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STATISTICAL APPROACHES TO DETECTION AND 
QUANTIFICATION OF A TREND WITH RETURN-ON- 
INVESTMENT APPLICATION 

D. P. Gaver 
P. A. Jacobs 



1. INTRODUCTION 

Members of a group of items such as aircraft, ships, tanks, etc use one or more 
subsystems of a particular type. These subsystems have basic design characteristics 
which result in specific values for measures of reliability, maintainability, 
repairability, etc. The values of these measures have a tendency to change, usually 
for the worse, over time. The time of onset of a degrading trend and the magnitude 
of the trend are unknown and must be estimated from data. 

The evidence of the degradation of a subsystem suggests the possible 
economic and operational value of a subsystem upgrade, either by redesign of the 
existing subsystem or replacement with a new subsystem. The decision to upgrade a 
subsystem will, at least partially, be based on a comparison of the costs of remaining 
with the current subsystem and those of investing in the upgraded subsystem 

The purpose of this report is to present preliminary models to assist in the 
assessment of the cost benefits of upgrading a subsystem in light of noisy data 
concerning its performance. 

One part of the model is for the detection and qualitative description of a 
possible trend in noisy data. Two preliminary formal mathematical models are 
presented. One model is presented in Appendix A. Appendix A also describes 
maximum likelihood procedures to estimate the time of onset of system 
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degradation and the magnitude of the trend for one version of the general problem 
described. It turns out that this general problem type has been recognized early and 
studied by many under the name of changepoint problems; see Carlin, Gelfand and 
Smith (1992) for a very recent review of a certain style of approach, plus many 
references. The present exposition is self-contained and is directed specifically at an 
economic choice problem that potentially arises frequently in military logistics and 
procurement. A Bayesian model is presented in Appendix D. Appendix D also 
describes the Bayesian estimation procedure. 

A second part of the overall model is a cost model which will have as input 
the estimates of the time of onset of system degradation and the magnitude of the 
trend. A simple cost model is presented in Appendix B. The cost model includes a 
fixed cost for upgrading the system as well as costs for each failure for both the 
current and upgraded subsystem. 

Appendix C presents the results of a simulation experiment using simulated 
data to illustrate the type of information that can be obtained concerning the cost 
effectiveness of upgrading a subsystem using the model of Appendix A in the light 
of the uncertainty of the time of onset of system degradation and the magnitude of 
the degradation; the maximum likelihood procedure of Appendix A is used to 
estimate the parameters. Appendix D contains results of the Bayesian analysis of the 
same simulated data and compares the results of the two procedures. 

In the following Section we informally discuss the general model in greater 
detail. In Section 3 we discuss an example and the results of the simulation 
experiments. Finally, in Section 4 we present conclusions. 
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2. THE MODEL 



To give a concrete feel for the data that we are considering examine the 
following illustration. The x's represent actual data ( e.g. total failures in a month), 
while the dotted lines represent the true, but hidden trend. 



Demand 



X 




1 234 56789 10 11 12 Time 



It is plain that there is little evidence of any change in the demand -level until 
possibly time t=9, when a retrospective look suggests that a change took place at 
about t=6 or 7. Successively more confirmation is given by observations at t=10 and 
11, etc. The human eye picks up this new trend rather quickly (the human brain 
should remain skeptical about its permanence, and questioning concerning its 
magnitude and eventual level). If the trend continues as suggested, greater and 
greater confirmation of its direction and magnitude becomes available; this is 
quantified by the preliminary mathematical models and statistical methods 
described in Appendices A and D. The statistical methods provide estimates of the 
true trend (denoted by the dotted line above) based on the number of failures 
observed (the x's above); the estimates are of the time of onset of system 
degradation and the magnitude of that degradation. As with the human eye, these 
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estimates become better as more confirmation is received concerning the 
permanence of the trend. 

A unique feature of problem described in the Introduction is that the 
statistical models and estimation procedures are only part of the story. The problem 
also includes a comparison of the cost of remaining with the current subsystem and 
the cost of upgrading it. These costs will depend on the estimates of the time of 
onset of subsystem degradation and the magnitude of that degradation. Appendix B 
presents a simple cost model which depends on these estimates. The model is as 
follows. The current subsystem has a fixed cost for each failure. The upgraded 
system has a (large) known initial fixed cost for the upgrade and then a fixed cost for 
each failure. In this simple model it is assumed that the upgraded subsystem has a 
known fixed failure rate and a known cost of repair/replacement. Further, it is 
assumed that the subsystem can be upgraded in one time period. After the initial 
fixed cost, , the mean cost of the upgraded system in each time period is a constant 
cost Cf^ multiplied by the (known) mean number of failures in each period. 
Informally, the cost model for the current subsystem is a (known) constant cost Cq 
multiplying the true trend; (the dotted line in the above picture). However, since 
the true trend is unknown, for each time t an estimate of the cost of the current 
subsystem will be computed by multiplying Cq by the estimate of the true trend 
obtained from data accumulated up to that time t. Estimates of the future cost of the 
current subsystem are computed by multiplying by the projected estimated trend. 
Thus, a decisionmaker will be comparing the (known) future cost of upgrading the 
subsystem with an estimated future cost of the current subsystem. 

Since the estimates of the true trend have variability, the estimated future 
cost of the current system will also have variability. As with the estimates of the 
true trend, one can expect the estimates of future subsystem cost to be quite variable 
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until sometime after the onset of degradation; this variability is due to uncertainty 
in the estimates of the true trend and the time of onset. It is important to consider 
this uncertainty in the assessment of whether or not to upgrade the current 
subsystem. For example, it may be that the estimated mean future cost of the 
current subsystem is larger than that for the upgraded subsystem but that the 
uncertainty associated with the estimated mean future cost of the current subsystem 
is high. This may indicate that it is better to wait to accumulate more information 
concerning the apparent degrading trend before deciding to invest in the upgrade. 

3. RESULTS 

An example with simulated data is presented in Appendix C. The complete 
simulated data set appears in Figure 1. The true time of onset of degradation is at 
time 10. Before time 10 the true mean number of failures in each time period is p=4. 
After time 10 the true trend is linear with a slope of ti=1.5. The true variability of 
the data about the true trend line is 0^=1 for each time period. 

The cost model for the example has the following features. The fixed cost per 
failure for the current subsystem is Co=2. There is a fixed initial cost Cp=225 for 
upgrading the subsystem. The upgraded subsystem will have a lower mean number 
of failures, X=2, in each time period but a higher cost per failure, Cf^=l2, than the 
current subsystem before the onset of degradation. After onset of degradation, the 
failure rate of the current subsystem may become larger than that of the upgraded 
subsystem due to the linear trend. The larger failure rate may offset the fixed cost of 
upgrading and make it economical to upgrade. There is also a time horizon, H=30, 
during which this subsystem or its upgrade will be used. If the onset of degradation 
is too close to the end of this time horizon, then because of the fixed upgrade cost, 
Cyr, it will not be cost effective to upgrade the subsystem. The decision to upgrade 
depends on the estimates of the time of onset of system degradation and of the 
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magnitude of the trend. The assessment of the cost of upgrading should reflect the 
uncertainty of these estimates. 

For each time t>5 the following policies are considered: upgrade the 
subsystem at each future time until the time horizon; all potential upgrading times 
from the present time until the time horizon H are considered; that is, if the current 
time is t=ll then the policies that would upgrade the subsystem at time 11, time 
12,..., time 29, (which is H-1), are considered. For each current time t, the (estimated) 
costs of these p>olicies are compared to the (estimated) cost of never upgrading the 
subsystem. The "optimal" (minimum estimated mean cost) policy can then be 
found. 

For comparison purposes the following is a description of the minimum 
mean cost policy in the (unrealistic) case in which the true trend in the data is 
known at each current time t. However, the decisionmaker is not omniscient and if 
the current time is before the onset of subsystem degradation, she will not know 
that this will occur. If at each time t, the correct trend for that time were known, 
then the minimum average cost policy would stay with the old subsystem until 
time 10. At time 10 (the time of onset of subsystem degradation), the decisionmaker 
instantly knows that change has occurred and the magnitude of the adverse trend so 
she can determine that the best policy is to upgrade the system at time 15. Suppose, 
however, that the decisionmaker becomes omniscient at a time after the time of 
onset of subsystem degradation (at time 10) and has not upgraded the subsystem. In 
this case for current times 11-15, the best policy is to upgrade the subsystem at time 
15. For current times 16-23, the best policy is to upgrade the subsystem immediately. 
For current times greater than 23 the best policy is never to upgrade the subsystem 
(the cost of upgrading exceeds the advantage). 
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For each time t>5, the simulation experiment of Appendix C considers the 
data accumulated up to time t and using the data as of that time estimates the time 
of onset of system degradation and the magnitude of the trend For each current 
time t, the estimated mean cost for each policy to upgrade the subsystem at some 
future time is computed using the current estimates of the trend. 

Figure 2 presents the times to upgrade the subsystem which correspond to the 
minimum estimated mean cost policies for each current time. On the x-axis appears 
the "current" time. On the y-axis is the time to upgrade corresponding to the 
minimum estimated mean cost policy. If the minimum estimated mean cost policy 
is never to upgrade, then the time to upgrade is set equal to the horizon time, H=30. 
The following is a verbal description of these optimal policies. If the current time is 
either 5 or 6, the best policy is never to upgrade the system. Note that for the current 
time 7 the best policy based on the current estimates of the trend is to upgrade the 
system at time 9. An examination of the data in Figure 1 shows that around time 7 
there is the local appearance of a positive slope. Hence, locally this policy is not 
unreasonable. However, the additional data point at time 8 results in updated 
estimates which indicate that the best policy at the current time 8 is never to 
upgrade. The best policy for current times 9-11 is never to upgrade. The best policy 
at current time 12 is to upgrade the subsystem at time 15. The best policy at time 13 
is to upgrade at time 14. The best policy at time 14 is to upgrade at time 15. The best 
policy at time 15 is to upgrade immediately. The best policy at times 16-23 is to 
upgrade immediately. The best policy for current times larger than 23 is never to 
upgrade. Hence, except for current time 7, the optimal policy using the estimated 
projected future mean costs agree fairly well with the optimal policy for the case in 
which the true trend is known. This suggests that the estimation procedure of 
Appendix A requires some patience: one should not upgrade the system the first- 
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time that an upgrade is indicated, but let some time elapse for confirmation. One 
would not expect as close an agreement if the variability of the data about the true 
trend were larger since it would become more difficult to estimate the time of onset 
of the degradation and the magnitude of the degradation. 

So far we have considered only point estimates of the mean cost of each 
policy. It may be that the variability of the estimated cost of each policy will yield 
more information. In Appendix C the variability of the mean policy costs computed 
using the estimation procedure of Appendix A is assessed using the computer 
intensive technique of bootstrapping. A brief description of the technique appears 
in that Appendix. Selected results are presented graphically in Figures 3-8. Each 
figure corresponds to a different current time and presents boxplots of bootstrap 
replications of the difference in estimated cost between a policy the upgrades at each 
future time and the policy that never upgrades. A description of the boxplot can be 
found in Appendix C; one can think of it as similiar to a very terse histogram. The 
y-axis represents the possible values of the cost differences. The x-axis represents the 
different possible times to upgrade. Since we are subtracting the cost of the policy of 
never upgrading from the cost of each policy which upgrades at a time in the future, 
a box and its appendages that correspond to negative values indicate that the 
estimated costs of the policy to never upgrade are higher than those for a policy that 
upgrades and hence it is better to upgrade the subsystem. Informally, the width of 
the box and its appendages are an indicator of the variability of the estimated cost 
differences between a policy which upgrades at a future time and the policy which 
never upgrades; the wider the box the more variable the estimated cost differences. 
One would expect the width of the boxes to be large around the time of onset of 
subsystem degradation because of the large uncertainty of the estimates. The circle 
in each box represents the mean; the line indicates the median. 
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Figure 3 presents boxplots of the mean cost differences for current time t=7 for 
all possible policies; the leftmost boxplot presents the cost differences for the policy 
of upgrading immediately at time 7; the next boxplot to the right presents the mean 
cost differences for the policy of upgrading the subsystem one time unit later, at time 
8,etc. The boxplots of Figure 3 indicate that it is better to upgrade the subsystem 
almost immediately; the "best" time to upgrade is around time 9, but the sensitivity 
to the precise upgrade time is low. The spread (width of the boxes) of the cost 
differences is high and there appears to be not much difference between upgrading 
the subsystem immediately or waiting until time 14. Hence, the assessment of the 
variability of the estimated mean costs is providing the decisionmaker with the 
ability to compare different policies in light of the uncertainty of the estimates of the 
degrading trend. This additional comparison may prevent a premature decision to 
upgrade. Notice that bootstrapping does away with the instability that may result 
when the simple point estimate is used, i.e. Figure 2. 

Suppose that the decision maker actually delays upgrading until later, either 
because she is still gathering information, she is concerned about the variability of 
the cost estimates, or because a rule tells her to wait. Figure 4 presents a similiar plot 
for the subsystem at current time 10; note that since the boxes of the cost differences 
correspond to positive values, the boxplots now indicate that the best policy is never 
to upgrade. Figure 5 presents boxplots for the policies evaluated at time 12; there is 
an indication that it is better to upgrade the subsystem; however, there appears to be 
little difference between upgrading at time 12 or at any time until time 17. Figure 6 
presents boxplots for current time 15; there is a clear indication that one should 
upgrade the subsystem either immediately or in the next time period. Figure 7 
presents the boxplots for the system at current time 18; there is an indication that 
one should upgrade immediately. Figure 8 presents the results for current time 25; 
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here the best policy is never to upgrade. Notice that as the decisionmaker 
accumulates more information concerning subsystem degradation, the estimates of 
the magnitude of the degradation are becoming better and the variability of the 
estimates for policy costs is becoming less. 

Figures 9-11 present results of using the Bayesian analysis presented in 
Appendix D to obtain information concerning the cost effectiveness of switching to 
the new system. The same data set which illustrated the procedures in Appendix C 
is used; the data are presented in Figure 1. Since the bivariate normal has 5 
parameters to be estimated, the estimation procedure begins with data 

Figure 9 presents the times to upgrade the subsystem which minimize the 
expected posterior mean cost for each decision time t=6,...29. On the x-axis appears 
the “current" time t. On the y-axis is the time to upgrade corresponding to the 
minimum expected mean cost policy using the posterior distribution given data 
Comparison with Figure 2 indicates the following differences between the 
optimal Bayes policies and the optimal maximum likelihood (ml) policies presented 
in Appendix C. The Bayes policy for t=7 is still to upgrade; however, the time to 
upgrade is later (time 13) than the ml policy (which says to upgrade at time 9). The 
Bayes policy at time 12 is to upgrade at time 16 rather than time 15 for the ml policy. 
The Bayes policies and the ml policies are the same for the other times. Thus, the 
Bayes policy is more conservative than the maximum likelihood policy when there 
is a change in the policy from never upgrading to upgrading. 

Simulation is used to obtain information concerning the variability of the 
posterior distribution of the average cost of each policy. For each current time t, a 
realization of the model is simulated from the posterior distribution and the 
average costs for each policy computed. Figures 10-11 present boxplots of 100 
replications of the simulated difference in estimated cost between a policy that 



10 



upgrades at each future time and the policy that never upgrade. The y*axis 
represents the possible values of the cost differences. The x-axis represents the 
different possible times to switch. 

Figure 10 presents the boxplots for simulated average policy cost differences 
using the posterior distribution at the current time of t=7. Figure 3 presents a 
similiar picture for bootstrap replications of estimated average policy cost differences 
using the maximum likelihood estimates of Appendix C at t=7. Comparison of the 
two figures indicates that the Bayes estimates of average cost difference are much 
larger and can be positive some of the time; recall that a positive difference implies 
that it is better never to upgrade. This behavior may be due to the Bayes procedure 
assessing greater variability to the estimated time of the onset of subsystem 
degradation than the procedure of Appendix A. Hence, the Bayes estimates are 
providing much less evidence of the need to upgrade. Recall that the onset of 
degradation does not occur until time 10. Hence, if the true model parameters were 
known, the best policy at time 7 (without omniscence) would be never to upgrade. 

Figure 11 presents results for current time t=12. Comparing this figure with 
the corresponding maximum likelihood figure. Figure 5, indicates that there is little 
practical difference between the two procedures in this case also. However, the 
widths of the boxes for the Bayesian procedure are larger than those for the 
maximum likelihood procedure. The greater widths are an indication of greater 
uncertainty concerning the future average costs for each policy. As a result, the 
Bayesian procedure is providing less evidence of the need to upgrade. This greater 
variability is once again probably due to the Bayes procedure assessing greater 
uncertainty to the time of onset of degradation. This suggests that the Bayesian 
procedure may be more cautious than the maximum likelihood procedure. 
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4. CONCLUSIONS 

Preliminary mathematical models have been formulated for the possible 
onset and growth in subsystem degradation. The model recognizes that the time of 
onset of a degrading trend may be random, and hence initially unknown, and that 
the trend magnitude is also initially unknown. The trend magnitude will become 
better known as more data is accumulated. Statistical procedures have been 
developed to estimate the time of onset and the trend magnitude. A rudimentary 
cost model has been used to develop procedures (which recognize the uncertainty 
concerning the time of onset and trend magnitude) to determine estimated costs 
and the associated risks cA upgrading the subsystem at different times in the future. 
An experiment using simulated data gives reasonable results and indicates that the 
consideration of variability in policy costs due to uncertainty concerning the time of 
onset and trend magnitude can lead to wiser decisions. 
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APPENDIX A 



STATISTICAL MODELS AND ESTIMATION PROCEDURES 



In this Appendix we present the statistical model and an estimation procedure. 



1. MATHEMATICAL FORMULATION OF A STATISTICAL MODEL 

We wish to use a sequence of fluctuating numerical values as a mathematical 
model for demand for a particular system during successive periods. Thus, consider 
a sequence of random variables with the following structure: 

a) X], X2, X3, ... Xc are identically and independently distributed, while 

b) Xc+i/ Xc+2/ ••• exhibit a linear trend. The time of onset of subsystem 
degeneration , C, called the changepoint, will realistically be unknown, as will the 
magnitude of the linear trend. 



Example 1: 



X, l<i<C; 

~N{u + {i-C)T],G^), C + l<i. 



(A.l) 



This is shorthand for the assumption that X,- is normally/Gaussianly distributed 
with mean ^ and variance up to the changepoint time C, and is normally 
distributed thereafter, with variance but with mean that grows — if the slope, 77 , is 
positive — linearly thereafter; p is the mean number of failures in each time period 
before the onset of degradation; C is the time of onset of degradation; Ti is the slope 
of the linear trend after degradation; and the variance is a measure of the 
variability of the actual number of failures about the true mean. This model should 
be appropriate for items whose mean demand /failure rate per time period, e.g. 
month, is reasonably large, but whose variance is relatively unchanged when and if 
a change in the mean occurs. 
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a :, ~ Po{^i), 



0 < i < C; 



Example 2: 



(A.2) 



- Poi^i + (i - C)ri), C + l<i 



meaning that X,- is Poisson distributed with constant mean up to C, but thereafter 
has a linear trend. This model is most appropriate when the basic demand rate is 
small. Many other models are even more appropriate; for instance the Negative 
Binomial; Correlated or time-dependent demands may also occur. Attention to all 
of these is postponed. The most trustworthy model is likely to be based on some 
actual data. We are in the process of assembling such data. 

2. LIKELIHOOD ESTIMATION 

Suppose observations of the variables (numbers of failures during time periods 
l,...,t) X], X2, ..., Xt are available; denote them by x-\, X2, ... xt- Then the likelihood 
function for the unknown parameters, C, r\, is as follows for Example 1: since 
the number of failures in successive time periods are independent, for 1 < C < /, and 
letting data = ix\, X 2 , ..., Xf) 







(A.3) 



so the log-likelihood is 



data) = 5;-|(jc,-^ f / -j{x,-n-{i-C)Tif / 




-—In (7^ -I- constant. 
2 



(A.4) 



This can be concisely written as 
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where 



{i-cr 



i-C 

0 



if i^C, and 
if i < C. 



(A.6) 



Note that the above applies if there is a changepoint within the range of 
observation; otherwise, if C > f then 



data) = n— ^ 



(A.7) 



and 

Tj, (T^, data^=^-—{x, - /if / - — \no^ + constant. (A. 8) 

Now in the following hold C fixed and behave as if it were known and the objective 
is to maximize / with respect to /i, rj, and cr^. Begin by differentiating with respect to 

|^ = £-(j:,-/i)(-l)/a'+ £-(a:,-/i-(/-C)T/)(-l)/a^, 0<C<r; (A.9) 

1=1 i=C+l 



= At )(-!)/ cr^, if C>t. 

1=1 



These expressions can be simplified and combined: 



(A.IO) 
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iL 



t-C 



£x(r)-r/i-7]£y 



CT^ 

= rx(f)-r/i 



(r-Cr-h(i-C) 

2 



T\ for r > C 
for r < C 



(A.ll) 



i 

where 3c(r) = 



1-1 



Rewrite this as 






((i-cr)%(i-cr 



where 



(f-cr = 



f-C ift>C; 
0 if f < C. 



(A. 12) 



(A. 13) 



If the derivative is set equal to zero we obtain the first “normal equation' 



H+Wx{C,t)r] = x{t) 



where here 



V/,(C.r) = j 



((r-C)*f +(r-0* 



(A.14) 



(A.15) 



Next differentiate with respect to r\: for t^C: 
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(A.16) 



where 



4= i-(^,-/--(i-C)r|)(-(i-C)) 

|=C+1 



i-C i-C 



y=l ;=1 



i=C+l 



I 



|=C+1 






>=1 



(A.17) 



Set the derivative equal to zero to obtain the second normal equation 
tl/^{C,t)^i+y/^{C,t)7] = - Jo:.(/-C)hJ2(C,0 

^ i=C+l 

Differentiate with respect to 



d£ 



= (^< - - (/ - c)* 



2 



{^) 



if this is equated to zero and solved for there results 

« .. m 2 



^ 1=1 



(A.18) 



(A.19) 



(A.20) 



Now solve the first two normal equations for the maximum likelihood estimate, 
conditional on C; the result is: 

P(C) = J6£:lV^ (A.21) 

¥2 - ¥i 

(A.22) 

¥2 - ¥i 

for C<t; for C>t, fi (C)=J, t)(C)= 0. These can now be substituted into (A.20) to obtain 
the maximum likelihood estimate for in terms of the other estimates, all 
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conditional on the value of C. Finally substitute the above estimates into the 
expression for the negative of the log likelihood: 



2 

-y/(/i(C,r),C, ^(C,t),^(C,r);data) s 5(C;data) 

^ ly , 1 y (x,-fiiC,t)-{i-C)fi{C,t)f 

th ^(C.r) 

+ lnCT^(C,r) (A.23) 

= 1 + lno^ (C.r) 

and obtain the value of C that mirumizes S(C;data) over the range (l,2,...,t); denote 
this by C(t); the last equality in the above expression follows from the definition of 
a^(C,t) given by (A.20). Thus, the estimate of C is chosen to minimize the sum of 
the squared residuals. If the minimum of S(C;data) occurs at t=C, then the 
conclusion is that no change has occurred in [0,t]. Note that all estimated parameter 
values, namely and depend upon the C value in use, and so the dependence 
of S upon C involves that implicit dependency. Once C(t) is developed this value is 
substituted into the expression for and a to obtain the maximum likelihood 
estimates of those parameters. Note: there are other procedures for estimating the 
changepoint. One is explored in Appendix D. Others will be investigated in later 
work. 
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APPENDIX B 



A MATHEMATICAL MODEL FOR COST AND RETURN-ON-INVESTMENT 
The statistical model discussed in Appendix A is one part of the problem. In 
this Appendix we describe a simple cost model that links with the trend estimation 
procedure. 

Suppose there is a cost cq incurred each time a subsystem fails; the total 
average cost incurred during the first t time periods is 



Q(0 = 



Co/ifr + l) 



x=0 



ifC>t, 

ifC<r 



(B.l) 



[=[Co/i + T7‘P/C,0](t + l) 

where p is the constant mean number of failure before the onset of subsystem 
degradation, C is the time at which the mean number of failures begins to show 
linear degradation, (trend), and ri is the magnitude of that linear trend. Note that 
the parameters C, p, and ri are all unknown but may be estimated from data. 

Suppose it is possible to upgrade the current system either by redesign of the 
existing system or replacement with a new system. The "new" system has a known 
constant mean number of failures in each time period X and an average cost per 
unit failure of . There is also a fixed cost Cp of changing to the new system. The 
total average cost of using the new system for t time units is 

C,(0 = c^ + c^At (B.2) 

Assume there is a planning horizon H during which the parent system will 
be operative; when the horizon is reached all (remaining) parents are stored or 
disposed of. Note that we do not consider salvage costs in this treatment; they can 
be introduced if desired. At each time t one can compare the future cost of the 
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current system to that of the new system and choose to change to the new system if 
it has a lower mean future cost; that is, at time t the following decision can be made: 

1) if Co(//)-Co(0<C;, + C;,A(//-r) (B.3) 

then keep the current system; 

2) if Co(//)-Co(r)>C;. + c^A(//-0 (B.4) 

then switch to the new system. 

Since the parameters |i, C, and are unknown, the estimation procedures 
described in the previous sections can be used to estimate them; the estimated 
average cost of the current system can then be obtained by computing Qft) using the 
estimates. If desired, alternative estimation procedures can be utilized. There are 
many such, and selection should be based on tests with real data. 
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APPENDIX C 



A SIMULATION EXPERIMENT 

In this Appendix we desaibe a simulation experiment to illustrate uses of the 
model of Appendices A and B. The random numbers were generated using 
LLRANDOMH, cf, Lewis and Uribe (1981). 

A data set of length 30 ( e.g. 30 months) is generated from model (A.l) with 
parameters p=4, t|=1.5, 0^=1, C=10; that is, data x,,...,X 3 o are generated using (A.l) 
with the above parameters. The planning horizon is H=30. The cost per unit failure 
for the old system is cq = 2 ; the fixed cost for changing to the new system is Cp=225; the 
cost per unit failure for the new system is c^=12; and the mean number of failures 
in each time period for the new system is A =2. Thus the new system has a lower 
failure rate than does the old system initially, i.e. before degradation sets in at 
(unknown) time C, but the cost per failure for the new system is higher. 

The following is a description of the optimal policy for each time t if the 
change to the new system has not occurred yet and the correct parameters are 
known for time t; if C has not yet occurred, then the policy decision uses only p and 
not C or T| for the decisionmaker is unaware that the system will degrade in the 
future. If at each time t, the correct parameters for that time were known, then the 
minimum average cost policy would stay with the old system until time 10. At time 
10 (the changepoint), the decisionmaker instantly knows that change has occurred 
and the magnitude of the adverse trend so he can determine that the best policy is to 
change to the new system at time 15. Suppose, however, that the decisionmaker 
becomes omniscient at a time after the changepoint (at 10). For times 11-15, the best 
policy is to change to the new system at time 15. For times 16-23, the best policy is to 
change to the new system immediately. For times greater than 23 the best policy is 
never to change to the new system (the cost of change exceeds the advantage). 
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Now consider the decision maker in the simulated environment, with 
realistic deviation from the average trend. For each time t using the data the 

following calculations are performed. Starting with t=5, estimates of and C 

using (A.20)-(A.22) and the procedure using (A. 23) are obtained using noisy 
simulated data denote the resulting estimates by and C(t). The 

estimated future mean cost of a policy that switches to the new system t time units 
in the future is computed for x=0,...,30-t; that is. 



Cq/5(t + 1) + Cp + CjyA(30 — (t + t)) 



if C > r 









J=0 



+ Cyr +C/vA(30-(r + T)) if C<r 



(Cl) 



is computed for each x and the minimum cost C„{t) = min C^(T,t) computed. This 

minimum cost is compared to the cost of doing nothing (and remaining with the 
current system) which is 



Co/i(30-(r- 1)) ifC>r, 

. 30-1 

Co(/i(30 - (r - D) + X n((r - cy + s)) if C < r 



(C.2) 



The policy associated with the minimum cost is chosen; in expressions (C.l) and 
(C.2) the estimates are jl = /}(/), etc.,for time t. 

Figure 1 presents the simulated data set. Figure 2 presents the optimal 
policies computed using (C.l) and (C.2) with the estimates using data for each 

time t>5; these computations assume that the decision maker is still getting 
information and that the change to the new system has not yet occurred. On the x- 
axis appears each time the best policy is computed. On the y-axis is the best policy's 
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time to change to the new system. The policy of never changing to the new system 
is represented by setting the time to change to the new system equal to the horizon 
H=30. The following is a description of the results. For times 5-6, the best policy is 
never to change to the new system. Note that at time 7 the best policy based on the 
current estimates is to change to the new system at time 9. An examination of the 
data in Figure 1 shows that around time 7 there is the local appearance of a positive 
slope. Hence, locally this policy is not unreasonable. However, the additional data 
point at time 8 results in updated estimates which indicate that the best policy at 
time 8 is never to change. The best policy for times 9-11 is never to change. The best 
policy at time 12 is to change to the new system at time 15. The best policy at time 13 

t 

is to change at time 14. The best policy at time 14 is to change at time 15. The best 
policy at time 15 is to change immediately. The best policy for times 16-23 is to 
change immediately. The best policy for times larger than 23 is never to change. 
This suggests that the current way of estimation requires some patience: one should 
not change to the new system the first-time that a change is indicated, but let some 
time elapse for confirmation. 

Resampling or "Bayesian Bootstrapping" 

A re-sampling technique called the bootstrap id. Efron et al. (1986)) can be 
used to assess the variability of the estimated mean cost associated with a policy due 
to the uncertainty of the parameter estimates. For each time t, 100 bootstrap 
replications x^{b;t),...,x,ib;t), b=l,...100, are generated using model (A.l) with the 
parameter estimates pit),fjit),ait), andCit). For each bootstrap replication the 
parameters p, Ti, cr^, and C are re-estimated obtaining pib;t),f}(b;t),dib;t), and C(f?;r). 
For each bootstrap set of estimated parameters the future mean cost of a policy that 
switches to the new system r time units into the future C^ib‘,v,t) is computed 
using (B.l). The cost of never changing, C^^ib;t) , is also computed using (B.2). 
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Figures 3-8 present boxplots for the differences in costs between the policy 
which says to change to the new system at time t+x and the policy which says never 

A A 

to change, C^{b;r,t)-C^^(b,t), for x=0,...,30-t for different times t for each bootstrap 
replication. The x-axis displays the time to change to the new system,t+x, for each 
policy. The y-axis displays the cost differences. A negative value of the cost 
difference indicates that it is better to switch to the new system; the more negative, 
the greater the estimated mean advantage of changing to the new system. 

All the graphical displays are produced by GRAFSTAT, a developmental 
product of IBM which the Naval Postgraduate School is using under a test 
agreement with IBM. The following description of the boxplot is taken from the 
documentation of GRAFSTAT. "The box portion of the plot extends from the lower 
quartile of the sample to the upper quartile. (The lower quartile is the point for 
which one quarter of the sample lies below and three quarters above. The upper 
quartile is analogous.) The line across the center of the box marks the median. The 
circle in the box represents the mean. 

The distance from the lower to the upper quartile is called the interquartile 
distance and it will be represented by Q. The points at the ends of the two lines 
(called whiskers) are the smallest and largest points, respectively, within 1.5Q of the 
quantiles. The points beyond the whiskers are outlying values." 

Figure 3 presents boxplots of the mean cost differences for t=7 for all possible 
policies; the leftmost boxplot presents the cost differences for the policy of changing 
immediately at time 7; the next boxplot to the right presents the mean cost 
differences for the policy of changing to the new system one time unit later, at time 
8,etc. The boxplots of Figure 3 indicate that it is better to switch to the new system 
almost immediately; the "best" time is around t+x=9, but the sensitivity to the 
precise change time is low. The spread of the cost differences is high and there 
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appears to be not much difference between switching to the new system 
immediately or waiting until time 14. Notice that bootstrapping does away with the 
instability that may result when the simple maximum likelihood estimate is used, 
i.e. Figure 2. 

Suppose that the decision maker actually delays change until later, either 
because she is still gathering information or because a rule tells her to wait, and she 
agrees. Figure 4 presents a similiar plot for the system at time 10; note that since the 
cost differences are positive, the boxplots now indicate that the best policy is never to 
change. Figure 5 presents boxplots for the policies evaluated at time 12; there is an 
indication that it is better to switch to the new system; however, there appears to be 
little difference between switching at time 12 or at any time until time 17. Figure 6 
presents boxplots for time 15; there is a clear indication that one should switch to the 
new system either immediately or in the next time period. The plot of Figure 7 
presents the boxplots for the system at time 18; there is an indication that one 
should switch to the new system immediately. Figure 8 presents the results for time 
25; here the best policy is to stay with the old system until the end of the time 
horizon H=30. 

The boxplots can be interpreted as representing an approximate Bayesian 
posterior density for the true expected or mean cost, given observations up to time t. 
Their depth (length of box) becomes smaller as more data accumulates and 
uncertainty of estimation of the changepoint and the degradation rate, T|, is reduced. 
But the depth of the boxes, plus the whiskers, provide perspective on the risk of 
changing soon, or waiting. Apparently the chance of making the wrong decision 
decreases if the decision maker waits, but also the value of making the more nearly 
correct decision decreases, for there is less time to the horizon. 
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It is important to be clear that the costs compared are estimated mean or 
expected costs, and not projected total costs, as there might be experienced during a 
future period. Boxplots that exhibit the probable range of these can also be exhibited. 
These more nearly represent true risk associated with actual return on investment. 

Finally, note that all present calculations ultimately assume that the basic 
model is correct, or a good approximation. It may well be reasonable to check 
historical data for the approximate way in which degradation occurs -- it need not be 
a simple ramp of slope T|, but perhaps precision of specification does not matter. 
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APPENDIX D 



BAYES APPROACH TO CHANGEPOINT ECONOMICS 

An enhanced version of the basic model presented in Appendix A is obtained 
by assuming that the changepoint (time of onset of degradation) is a random 
variable, C, with specified distribution whose parameter is unknown and subject to 
a probability density, 7i( ). Specifically, suppose 

P{C = k} = (\-pf-^p, (D.l) 

i.e. is geometric, and that the prior 7i( ) is beta. As t advances one observes 
and so in effect one has noisy observations on the outcomes of a 
biased coin flip with unknown success probability p. 

We also generalize the normal model (A.l) to incorporate more general 
known trend functions than linear; in this Appendix, suppose that X-observations 
have the following structure: 

a) Xi, X2,...,Xc are independent and identically distributed normal random 
variables with mean p and variance g^, while 

b) Xc+1, Xc+2/ "/ Xt are independent with Xi normally distributed with 
variance o^ and mean p+T)g(i,C); that is, 

X,-Nip,o^) ifl<j<C; 

-N{p + gU,C)7i,o") ifC + l<i. 

The function g is an arbitrary nonnegative nondecreasing function with g(i,C)=0 for 
i<C representing the known form of the degrading trend which occurs after the time 
of onset of degradation. For the model of Appendices A and C, g(i,C)=(i-C)'*', a ramp 
starting at time C. 

Putting (uninformative) priors on and p, it is shown that the joint 

posterior density of these is straightforwardly obtained; the parameter is initially 
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estimated from residuals without using a fully Bayes approach. In principle all of 
the above could be carried out for any arbitrary, but reasonable, discrete distribution 
that might better represent what is known about the changepoint process. A 
similiar statistical model was used by Smith(1975). 

In what follows we sketch the development. Suppose that X-observations, 
are available up to time t, it follows that 



p[p&{dp),C = k,p e (dp),rie (dri) ,X, e {dx{),...,X- e (dXj),...X, e (dx,)} 
= n(p)0 - P)*~‘ pH ^gU>lc)f'^dpdT]dp 



for k<t (D.3) 



= - p)‘^^dpdT]dp 



for k=t+l 



The term involving (1-p)* represents the case in which no changepoint has 
occurred; we will set k=t+l for this case. 

By a completion of squares process one can xvrite the likelihood function for 
given C=k as a bivariate normal density with parameters dependent on k and data 
up to t; the exponential term of the likelihood is written as 



-p - 7]g(j,k)f^ 



= c exp-^- 



1 



2(l-p^) 






V 



-X(k,t) 



(D.4) 



r yo 

for l<k<t-l; for k>t we have no changepoint so the exponential term of the 
likelihood is of the form 



29 



nexp{-^U,-^)>} 



= c exp<- 



1 Ui-Jif 



-K{k,t)\ 



2 y J (D.5) 

where in the above c is a constant, and the parameters all depend upon k,t, and x{t), 
the data up to time t. 

For k<t, the parameters of the bivariate normal (D.4) turn out to be 






(D.6) 



82 



(D.7) 



7 = 



82 



82~(8i) t 



(D.8) 



and 



where 



82~{8x) t 



P(M) = — 

V^2 






I ;=1 



(D.9) 



(D.IO) 



(D.ll) 



2 < 

c, =x^(k,t) = -'^XjgU,k) 
^ 7=1 



(D.12) 






8i=8i(k,t) = -^ g{j,k)-, 






(D.13) 
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(D.14) 



^2 =^ 2(^.0 = -X 

^ y=l 

and 

K{k,t) = -mk,t)-riikJ)8ij,k)f . (D.15) 

For the case k>t 

= (D.16) 

^ ;=1 



yHk,t) = —; 

t 



and 



(D.17) 



(D.18) 



Tj{k,t) = 0, v^{k,t) = 0, andp(k,t) = 0. These values can be derived directly from (D.4) 
and (D.5); details are omitted. 

If the bivariate normal form is utilized in (D.3) and the integration is the 

integration is performed over p \ve obtain the joint conditional density of C, p, and t| 

given the data and in the form 
P{C = k,^ € (d^y r\ e {dr]) U(/), <j ^ } 



= 7t{k,t) 



1 






2k{\- p ) yu 



2(l-p^)L y 



(P-Pf I I 

Jj (D.19) 



for k<t where 

7i\k,t) = yu£(l - p)*'' pKip)dptxp{-K{k,t)); 



for k>t 



P[C=k,p€idp)\x{r),o^} 



with 



(D.20) 



(D.21) 
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n (f, i) = c y£(l - p)"' pn(p)dp txp{-K (r,r» (D.22) 

7[\t + l,f) = c* - p)' 7t(p)dpcxp{-K{i + 1,0) (D.23) 

and 

f+1 

c* = i^K*(k,t)y^ (D.24) 

t=i 

Note that {7i*(k,t), k<t} is the marginal probability that the changepoint occurs at any 
time k up to and including t; while 7t*(t+l,t) is the posterior probability that no 
changepoint has occurred up to time t; that is, 

n(k,t)=P{C = k\X,=x,,...,X, = x,} fork<t (D.24a) 

nit + lt) = P[C>t\X,=x^,...,X,=x,}. (D.24b) 

For each time t, the estimate of is computed from the squared residuals for 
each possible value of C=k in the following manner; let 

a\kj) = -^'^(x^-pik,t)-mj)8ij\k)f ifk<t (D.25) 

a\kj) = -^fix-p(k,t)f ifk>t. (D.26) 

Finally, the estimate of the variance based on data is 



a^{t) = '^7i*ik,t)a^ik,t). (D.27) 

Given C=k, k<t, and the data x^,...,x,, the posterior distribution of (|i,T|) is 
bivariate normal with mean (JIik,t),fjik,t)), variance of p equal to variance of 

Ti equal to t)2(k,t), and correlation p(k,t); for k=t,t+l, T|=0 and the posterior 
distribution of p is normal with mean JI{k,t) and variance y2(k,t). Hence, given the 
data x^,...,x,, the posterior distribution of (p,T|) is a mixture of bivariate normal 
distributions with mixture distribution {7c*(k,t), k<t+l}. 
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At time t, the future mean cost of a jX)licy that switches to the new system t 
time units in the future is given by (C.l) with (t-C)'*' replaced by g(t,C); the mean cost 
of remaining with the current system is given by (C.2) with (t-C)+ replaced by g(t,C). 
Since the mean costs are linear in p and ri, the posterior distribution of the future 
mean cost given the data is a mixture of normal distributions with mixture 

distribution {7i*(k;t), k<t+l). The variability of the estimated future mean cost for 
each policy can be evaluated either by computing the percentiles of the posterior 
distribution or by simulating the posterior distribution. 

Figures 9-11 present results of using the Bayesian analysis presented in this 
Section to obtain information concerning the cost effectiveness of switching to the 
new system. The same data set which illustrated the procedures based on 
maximum likelihood in Appendix C is used; the data are presented in Figure 1. 
Since the bivariate normal has 5 parameters to be estimated, the estimation 
procedure begins with data The initial estimate of is 

= (D.28) 

^ ;=1 

where x is the sample average of the first 5 data points. For each time t, estimates of 
the posterior distribution are obtained from equations (D.6)-(D.27). The updated 
estimate of is used as input for the calculations for the next time period. 

Figure 9 presents the times to upgrade the subsystem which minimize the 
expected posterior mean cost for each decision time t=6,...29. On the x-axis appears 
the "current" time t. On the y-axis is the time to upgrade corresponding to the 
minimum expected mean cost policy using the posterior distribution given data 
Comparison with Figure 2 indicates the following differences between the 
optimal Bayes policies and the optimal maximum likelihood (ml) policies presented 
in Appendix C. The Bayes policy for t=7 is still to upgrade; however, the time to 
upgrade is later (time 13) than the ml policy (which says to upgrade at time 9). The 
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Bayes policy at time 12 is to upgrade at time 16 rather than time 15 for the ml policy. 
The Bayes policies and the ml policies are the same for the other times. Thus, the 
Bayes policy is more conservative than the maximum likelihood policy when there 
is a change in the p>olicy from never upgrading to upgrading at some time. 

Simulation is used to obtain information concerning the variability of the 
posterior distribution of the average cost of each policy. For each current time t, a 
realization of (C,|i,T|) is simulated from the posterior distribution and the average 
costs for each policy computed. Figures 10-11 present boxplots of 100 replications of 
the simulated difference in estimated cost between a policy that switches to the new 
system at each future time and the policy that never switches. The y-axis represents 
the possible values of the cost differences. The x-axis represents the different 
possible times to switch. 

Figure 10 presents the boxplots for simulated average policy cost differences 
using the posterior distribution at the current time of t=7. Figure 3 presents a 
similiar picture for bootstrap replications of estimated average policy cost differences 
using the maximum likelihood estimates at t=7. Comparison of the two figures 
indicates that the Bayes estimates of average cost difference are much larger and can 
be positive some of the time; recall that a positive difference implies that it is better 
never to change. Hence, the Bayes estimates are providing much less evidence of 
the need to change to the new system. Recall that the changepoint does not occur 
until time 10. Hence, if the true model parameters were known, the best policy at 
time 7 would never change. 

Figure 11 presents results for current time t=12. Comparing this figure with 
the corresponding maximum likelihood figure. Figure 5, indicates that there is little 
practical difference between the two procedures in this case also. However, the 
widths of the boxes for the Bayesian procedure are larger than those for the 
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maximum likelihood procedure. The greater widths are an indication of greater 
uncertainty concerning the future average costs for each policy. As a result, the 
Bayesian procedure is providing less evidence of the need to upgrade. This greater 
variability is once again probably due to the Bayes procedure assessing greater 
uncertainty to the time of onset of degradation. This suggests that the Bayesian 
procedure may be more cautious than the maximum likelihood procedure. 
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GLOSSARY 



H: mean number of failures per time p>eriod for current subsystem before 
degradation 

T|: multiple of degrading trend for current subsystem 

o^: variance of number of failures per time period for current subsystem 

C: time of onset of degrading trend for current subsystem 

Xi: number of failures occurring in time period i for current subsystem 

X: mean number of failures per time period for upgraded subsystem 

Co : cost per failure for current subsystem 

Cf : initial fixed cost for upgrading current subsystem 

c^: cost per failure for upgraded subsytem 
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Figure 2 
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TIME OF CHANGE TO NEW SYSTEM 
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BOOT(AV COST FOR EACH POLICY-AV COST NO CHANGE) 
POLICIES BASED ON ESTIMATES USING X1-X25;TIME 25 
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SIMULATIONS OF POSTERIOR GIST GIVEN XI -X7 
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FIGURE 11 
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